feat: Add Apache Spark integration to OpenServerless#188
Open
mobs75 wants to merge 11 commits intoapache:mainfrom
Open
feat: Add Apache Spark integration to OpenServerless#188mobs75 wants to merge 11 commits intoapache:mainfrom
mobs75 wants to merge 11 commits intoapache:mainfrom
Conversation
- Add Spark operator build and test tasks in task project - TaskfileBuild.yml for GHCR image building - TaskfileTest.yml for SparkJob testing - Sync with openserverless-task fork feature/enable-spark-in-whisk
…dd-spark-operator branch
- Update operator submodule to include Spark integration (commit afc74b4) - Add comprehensive Spark deployment support - Enable Spark Master, History Server, and Worker management - Integrate with MinIO for Spark event logs storage
- Add Spark operator build tasks - Add Spark testing workflows - Update Whisk CRD with Spark configuration
Author
|
Hi, thanks for the review. Could you tell me exactly which aspects of my Spark operator don't meet the project's standards? Specifically, I'd be interested in knowing if it's directory structure, naming, deployment methods, CLI integration, documentation, or anything else, so I can make targeted corrections and bring my work in line with the already accepted operators. Thanks! |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR adds comprehensive Apache Spark integration to OpenServerless, enabling users to deploy and manage Spark clusters alongside their serverless workloads for big data processing capabilities.
Architecture
The integration is implemented in the operator submodule and follows OpenServerless patterns:
Key Features
Spark Components
Technical Implementation
1Gi↔ JVM1g)Changes
Operator Submodule (commit
afc74b4)nuvolaris/spark.py- Complete Spark operator implementationpatcher.py,main.py)Configuration
Testing
Tested on MicroK8s cluster:
spark://spark-master:7077)Verification
Use Cases
Future Enhancements
Documentation
User documentation and examples to be added in follow-up PRs.
Related Issues: Closes #[issue-number]
Operator Submodule PR: mobs75/openserverless-operator#[pr-number]